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Abstract 

When the available statistical information is imperfect, it is dan- 
gerous to follow standard optimisation procedures to construct an op- 
timal portfolio, which usually leads to a strong concentration of the 
weights on very few assets. We propose a new way, based on gener- 
alised entropies, to ensure a minimal degree of diversification. 
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One of the major problem in portfolio optimisation and diversification is 
that the optimal solutions (for example in the risk/return sense of Markowitz 
[|I|]) are very often concentrated on a few assets, in contradiction with the very 
idea of diversification. This feature is considered as unreasonable by most 
practioners, who feel that it is unwise to discard assets based on the fact 
that their past performance (or their anticipated performance) is somewhat 
weaker than those spotted out by the optimisation. Furthermore, due to 
the instability of the covariance matrix and average return in time, the few 
assets retained in the allocation tend to evolve with time; a situation which 
is not satisfactory from an intellectual viewpoint, and furthermore induces 
large costs. A way out is to impose to the optimisation procedure to remain 
within preassigned upper and lower bounds for each asset; however, this 
corresponds, to a large degree, to choosing one's target portfolio, since many 
assets will have weights equal to the minimum allowed weight. 

The aim of the present paper is to introduce a family of 'diversification 
indicators' which measures how concentrated is the allocation. The opti- 
misation can be performed with a constraint which precludes a too strong 
'localisation' of the weights on a few assets. Perhaps more importantly, we 
relate these indicators to the information content of the portfolio, in the 
sense of information theory. A strongly concentrated portfolio corresponds 
to a large information content, while equal weights to all assets corresponds 
to a minimal information content. We argue that the rationale behind try- 
ing to keep a balanced portfolio is that its information content cannot be 
larger than the information available to the fund manager. This intuitively 
makes sense: suppose that the average returns and covariances are perfectly 
known and that one accepts the risk/return framework. The choice of an 
optimal portfolio on the efficient border is then totally justified, even if the 
weight of a single asset was 100%. The problem is that one usually has only 
partial information: past data have a finite length, therefore the determina- 
tion of statistical parameters is imperfect; predictions of future volatilities 
and returns are obviously not entirely trustworthy - this is why the optimal 
portfolio must refiect this lack of information and keep a minimal degree of 
diversification. This can be implemented using the idea of a 'free-utility' 
function, in analogy with the free-energy in thermodynamics. 

Let Pi, i = 1,...,M be the weights of asset i in a portfolio, such that 
J2iPi = 1- The problem of characterising how uniform (or how concentrated) 
these Pi are has been investigated in a very different context, namely that 
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of the phase space of disordered systems 0. The following quantities were 
studied: 

i 

Obviously, for g = 1, Yi = 1. Suppose now that all weights are equal to 
1/M, where M is the total number of assets. Then Y2 = 1/M and thus 
goes to zero for large M. On the other hand, if one of the asset has weight 
p and all the others weight (1 — p)/{M — 1), one finds Y2 = p"^ + 0{1/M), 
which thus remains finite for large M. Y2 (and similarly for all Yg, q > 1) 
thus distinguishes 'localised' situations, where one, or a few assets gather all 
the weight, from 'delocalised' configurations, corresponding to a high degree 
of diversification. Actually, Y2 is nothing but the average weight of each 
asset. We can thus define an effective number of assets in the portfolio as 
Mes = 1/^2, and look for optimal portfolios with Mcs greater or equal than 
some minimal value Mq. Let us sketch the calculations within a classical 
Markowitz scheme. 

One first checks whether the usual Markowitz solution satisfies the min- 
imum Mgfj constraint; if not, one then looks for solution constrained to 
Mefj = Mq. Introducing the covariance matrix Cij and the expected return 
vector Tj, minimisation of the variance of the portfolio for a given average 
return and for a fixed effective number of assets leads to: 



_d_ 

dpi 



jk j j j 



(2) 



or, in matrix notation: 



p = ^C-'[Xr + (3) 

with Cij = Cij + uSij. The three Lagrange parameters A, /i, z/ are determined 
as to satisfy the three constraints (sum of weights normalised to 1, fixed 
average return R and fixed M^s = 1/^2)- This generates a family of 'sub- 
efficient border' which is drawn in together with the usual efficient border, 
which corresponds to z/ = (no constraint on Y2). Technically, Eq. ^requires, 
as in the usual case, the inversion of a matrix which is the covariance matrix 
plus a constant along the diagonal; this constant must be adjusted to satisfy 
the constraint. Optimisation with a fixed Yg with g 7^ 2 is of course possible, 
although the calculations are somewhat more complex. 
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The relation with information theory is based on the fact that the indica- 
tors Yq are actually generalised entropies 0. Indeed, the classical definition 
of the entropy (or missing information) associated to the weights Pi is 



(4) 



which is zero if the portfolio is concentrated on one asset, and maximum for 
an uniform portfolio {pi = 1/M). It is easily checked that S can be expressed 
in terms of the Y^'s as: 



All the Yg's play a role similar to the entropy, and can be used to limit the 
intrinsic informational content associated to the very choice of the weights: 
the larger the available information, the larger the allowed values of Yg. It is 
clear that in the absence of information, the only rational choice is Pi = l/N; 
while complete information corresponds to z/ = 0. It is interesting to note an 
analogy with thermodynamics: at zero temperature (absence of uncertainty), 
one must minimise the energy, which is the analogue of (minus) a utility 
function. At nonzero temperature, one must minimise a 'free-energy' which 
has both an energetic and entropic contribution. Similarly, one can introduce 
a general 'free-utility' J^q which contains an entropic contribution: 



where U is a utility function. The above optimisation, Eq. ^ corresponds 
to U = XR — 0"^, where cr^ is the variance of the portfolio, u plays the role 
of a 'temperature', which is high when the uncertainty is large. It is obvious 
that if A = (no constraint on the average return), one finds Pi = 1/M in 
the limit where u — > oo. Note that the dimension of u is the same as that of 
0"^. It could be interesting to work in a 'canonical' ensemble where u, rather 
than Y2, is fixed. A natural choice for 1/ would then be the squared error in 
the average returns rj. 

In summary, we have argued that optimisation under incomplete informa- 
tion should be performed with constraints on generalised entropies (which are 
actually diversification indicators), much like in standard thermodynamics. 
This generates new sub-efficient borders, and allows one to avoid undesired 



J^q = U-iy 




(6) 



q-l 
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overconcentration of optimal portfolios one very few assets, in accord with 
general market practice. This idea is not restricted to the Markowitz optimi- 
sation scheme, and is useful also when the risk is measured, not as a variance, 
but as the Value-at-Risk, which in a non-Gaussian world leads to different 
optimal portfolios 0. 
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Figure 1: Example of a standard efficient border i> = (thick line) with 
four risky assets. If one imposes that the effective number of assets is equal 
to 2, one finds the sub-efficient border drawn in dotted line. The inset shows 
the effective asset number of the unconstrained optimal portfolio (z/ = 0) as 
a function of average return. The optimal portfolio satisfying M^s > 2 is 
therefore given by the standard portfolio for returns between ri and r2 and 
by the M^s — 2 portfoho otherwise. 
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